Cumulative Optimality in Risk-Sensitive and Risk-Neutral Markov Reward Chains
نویسنده
چکیده
This contribution is devoted to risk-sensitive and risk-neutral optimality in Markov decision chains. Since the traditional optimality criteria (e.g. discounted or average rewards) cannot reflect the variability-risk features of the problem, and using the mean variance selection rules that stem from the classical work of Markowitz present some technical difficulties, we are interested in expectation of the stream of rewards generated by the Markov chain that is evaluated by an exponential utility function with a given risk sensitivity coefficient. Recall that for the risk sensitivity coefficient equal zero we arrive at traditional optimality criteria. In this note we present necessary and sufficient risk-sensitivity and risk-neutral optimality conditions; in detail for unichain models and indicate their generalizations to multichain Markov reward chains.
منابع مشابه
Optimal Stationary Policies in Risk-sensitive Dynamic Programs with Finite State Space and Nonnegative Rewards
This work concerns controlled Markov chains with finite state space and nonnegative rewards; it is assumed that the controller has a constant risk-sensitivity, and that the performance of a control policy is measured by a risk-sensitive expected total-reward criterion. The existence of optimal stationary policies is studied within this context, and the main result establishes the optimality of ...
متن کاملRisk-Sensitive and Average Optimality in Markov Decision Processes
Abstract. This contribution is devoted to the risk-sensitive optimality criteria in finite state Markov Decision Processes. At first, we rederive necessary and sufficient conditions for average optimality of (classical) risk-neutral unichain models. This approach is then extended to the risk-sensitive case, i.e., when expectation of the stream of one-stage costs (or rewards) generated by a Mark...
متن کاملRecent Results in Controlled Markov Chains with Risk Sensitive Average Criteria: the Vanishing Discount Approach
Countable state space Markov cost/ reward chains, satisfying a Lyapunov-t ype stability condition, are considered in this work. For an infinite planning horizon, risk sensitive (exponential) discounted and average cost criteria are considered. The main contribution is the development of a vanishing discount approach to relate the discounted criterion problem with the average criterion one, as t...
متن کاملThe vanishing discount approach in Markov chains with risk-sensitive criteria
In this paper stochastic dynamic systems are studied, modeled by a countable state space Markov cost/reward chain, satisfying a Lyapunov-type stability condition. For an infinite planning horizon, risk-sensitive (exponential) discounted and average cost criteria are considered. The main contribution is the development of a vanishing discount approach to relate the discounted criterion problem w...
متن کاملControlled Markov chains with risk-sensitive criteria: Average cost, optimality equations, and optimal solutions
We study controlled Markov chains with denumerable state space and bounded costs per stage. A (long-run) risk-sensitive average cost criterion, associated to an exponential utility function with a constant risk sensitivity coe1⁄2cient, is used as a performance measure. The main assumption on the probabilistic structure of the model is that the transition law satis®es a simultaneous Doeblin cond...
متن کامل